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Description 

[0001] This invention relates to DNA sequences encoding novel biosynthetic insulin precursors and to preparation 
of insulin by culturing transformed cells comprising such DNA sequences. 
5 [0002] Insulin is a hormone, whose primary role is to control the transport of glucose from the bloodstream into cells 
where it is metabolized. 

[0003] Insulin consists of two chains of amino acids: A-chain consisting of 21 amino acids and B-chain consisting of 
30 amino acids, the chains being linked with each other by two disulfide bridges. A third disulfide bridge is to be found 
within the A-chain. 

10 [0004] The primary form of insulin molecule is synthesized as a nascent polypeptide chain named preproinsulin, 
which consists of a signal peptide and proinsulin. The signal peptide directs the nascent chain across the endoplasmic 
reticulum. Proinsulin is produced from preproinsulin in the lumen area of endoplasmic reticulum as soon as the polypep- 
tide chain crosses the membrane. Proinsulin encompasses a polypeptide chain comprising a chain of 35 amino acids 
which is not present in the insulin. This connecting peptide or C-peptide links the C-end of the B-chain to the N-end of 

15 the A-chain of the future insulin molecule. From the lumen of endoplasmic reticulum proinsulin is routed to the Golgi 
apparatus where the proteolysis of connecting peptide starts. The connecting or C-peptide is cut out by an enzymatic 
system similar to trypsin/carboxypeptidase B system, which acts at the two consecutive basic amino acids sequence 
and insulin is produced (Steiner et al., J. Cell Biol. 24, (1984) 121-130). 

[0005] The importance of the C-peptide is that it links the B- and A-chains so as to allow a proper formation of disulfide 
20 bridges in the insulin molecule (Bell et aL, Nature 284 ( 1 980) 26-32) or that it directs the trypsin-like converting enzyme 

to process proinsulin at the site with two basic amino acids (Thim et al., PNAS.83 (1986) 6766-6770). 

[0006] The production of biosynthetic human insulin has been reported mainly in the bacterium Escherichia coli and 

the yeast Saccharomyces cerevisiae. It is common for all of them that DNA sequence encodes either for the entire 

proinsulin, a modified part of proinsulin or separately for the A- and B-chains. 
25 [0007] In the bacterium E. coli insulin has been produced either via separate A- and B-chains (Chance et al., Diabetes 

Care 4 (1982) 147-154) or as proinsulin, EP 55945. 

[0008] In the yeast S. cerevisiae insulin has been produced via proinsulin, yet with a very low yield, EP-A-121 884. 
Various modifications of C-peptide improve the yield and the proper formation of the insulin molecule in the yeast. 
[0009] Insulin precursors of formula B-X-A, wherein B and A represent the B- and A-chains and X represents peptide 
30 comprising from 2 to 35 amino acids, are described in DK-A-5284/87. 

[0010] Insulin precursors of formula B-X-Y-A, wherein B and A represent the B- and A-chains and X and Y each 
represent lysine or arginine, are disclosed in EP-A-195 691. 

[0011] In EP-A-163 529 insulin precursors containing the peptide chain B(1-29)-A(1-21) linked with a short peptide 
having 2 to 8 amino acids are described. Modifications of des-B(30) insulin precursors of formula B(1-29)-X r X 2 -Y 2 - 

35 Y r A(1-21) are described in EP-A-347 845. 

[0012] In EP 249 350 glucoamylase promoter and signal sequence for secretion of proteins in fungi as hosts are 
described. As an example, an expressed proinsulin trimer was also inserted into such an expression vector. 
[0013] ' Many polypeptides for therapeutic or diagnostic use, e.g. erythropoietin, tPA and factor VIII, have been N- 
glycosylated (Parekh et al., TIBTECH 7 (1989) 117-121). 

40 [0014] There has also been described a triply increased yield of bovine chymosin (glycochymosin) in the fungus A. 
n/gervar. awamori achieved by the introduction of a N-glycosylation consensus site into a coding sequence. By a fusion 
of prochymosin and glucoamylase gene even a higher yield was achieved, but by a fusion of glycochymosin to glu- 
coamylase no further increase of the yield was achieved (Berka et al., Biochem. Soc. Trans, 19(1991) 681-685). 
[0015] In the case of aspartic proteinase (renin), which is of microbial origin and serves as a chymosin substitute, it 

45 has been observed that hyperglycosylation may reduce the enzyme activity if it occurs at a critical site in the molecule. 
(Berka et al., Biochem. Soc. Trans. 19 (1991) 681-685; Aikawa et al., J.Biol.Chem. 265 (1990) 13955-13959). 
[0016] A mutation of glycosylated asparagine in the coding region of renin resulted in a significant reduction of se- 
cretion in yeast cells (Aikawa et al., J.Biol.Chem. 265 (1990) 13955-13959). 

[0017] The object of the invention are DNA sequences encoding novel biosynthetic insulin precursors and the use 

50 of eucaryotic cells, particularly fungal cells for preparation of insulin. 

[0018] The object has been achieved by means of a DNA sequence comprising encoding sequence for insulin pre- 
cursors of the formula B-Pg-A, wherein B and A represent B- and A-chain respectively and Pg represents a modified 
C-peptide or any number of amino acids containing at least one N-glycosylation consensus-site. Pg links B- and A- 
chains by proteolytic processing signals or specific chemical cleavage sites. 

55 [0019] The modification of proinsulin gene is preferred by creating N-glycosylation site AsnXSer. This can be 
achieved in a C-peptide sequence e.g. by changing the codon GCC for Ala-C-20 amino acid to the codon AAC for Asn. 
[0020] The same principle can be applied to des-B(30)-proinsulin. 

[0021] The invention is based on the finding that the yield of immunoreactive insulin, measured by RIA method, is 
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much higher in transformants comprising DNA sequence of the formula B-Pg-A fused with protective protein than in 
transformants comprising a nonmodified DNA sequence of proinsulin fused with a protective protein. The cells trans- 
formed with a vector comprising the DNA sequence of nonfused proinsulin or nonfused modified proinsulin with N- 
glycosylation consensus site give a very low yield or none at all of immunoreactive insulin, measured by RIA method. 
5 [0022] Therefore the introduction of N-glycosylation consensus site to a linker region which is not a part of a mature 
insulin molecule remarkably increases the expression in fungal cells which are transformed with the DNA sequence 
encoding such fused insulin precursors. Fused protein and proinsulin can either be processed in the host organism 
itself or removed by a later enzymatic or chemical cleavage. 

[0023] The DNA sequence encoding the said modified insulin precursor can be prepared by an oligonucleotide syn- 
10 thesis of the entire DNA sequence in a combination with PCR or by in vitro mutagenesis. The DNA sequence can also 
be codon-optimized for expression in a suitable organism, preferably in a fungal strain. 

[0024] The invention also relates to a process for the preparation of insulin by culturing suitable cells, preferably 
fungal cells, transformed with a vector comprising a DNA sequence encoding insulin precursor of the formula B-Pg-A 
in a suitable culture medium. 

15 [0025] The expression and secretion of the peptide of the above formula can be achieved by using a vector in which 
the encoding sequence is regulated by a promoter, a signal sequence and a terminator functional in a suitable organism. 
For example, it can be p g i aA and a signal sequence of giaA gene or a part thereof suitable for expression in Aspergillus 
sp. (Cullen et al., Biotechnology, 5 (1987) 369-376). The expression and secretion of the peptide can also be achieved 
by using an expression vector in which DNA sequence encoding the insulin precursor of the above formula is fused 

20 to the gene of protective protein via a processing signal (Ward et al., Biotechnology 8 (1990) 435-440; Broekhuijsen 
et al., J. Biotech. 31^ (1993) 135-145) or a chemical cleavage site. 

Fig. 1 shows a nucleotide sequence of both, the natural and the codon-optimized proinsulin gene and a corre- 
sponding amino acids sequence, using conventional abbreviations for nucleotides and amino acids. 
25 Fig. 2 describes two types of human proinsulin gene expression cassettes and schematically shows the human 

proinsulin gene and mutant gene comprising N-glycosylation consensus site. 

Fig. 3 explains the strategy of construction of human proinsulin gene and mutant gene comprising N-glycosylation 
consensus site for two types of expression cassettes via fragments synthesized separately by "overlapping oligo- 
nucleotides in combination with PCR". 
30 Fig. 4 shows 1 4 oligonucleotides designed for the synthesis of human proinsulin gene and mutant gene comprising 

N-glycosylation consensus site. 

Fig. 5 explains, on the example of fragment I synthesis, the novel approach that was used for the synthesis of 
human proinsulin gene by the use of overlapping, complementary oligonucleotides in combination with PCR. 
Fig. 6 schematically shows the synthesis of fragment IV (oligonucleotides are indicated by arrows only). 
35 Fig. 7 schematically shows the synthesis of fragment II, fragment III and fragment V (oligonucleotides are indicated 

by arrows only). 

Fig. 8 shows the construction of plasmids pP2G301, pPZG302, pPZG303, pPZG304, pPZG305. 
Fig. 9 shows the construction of plasmids pPZG311, pPZG312. 

Fig. 10 shows the construction of the filamentous fungal expression vectors for expression cassettes of type I. 
<o Fig. 11 shows the construction of the filamentous fungal expression vectors for expression cassettes of type II. 

Fig. 12 shows the sequence at the GlaG2 junction of glaA gene. 

Examples 

45 Example 1 

Synthesis of human proinsulin gene and mutant gene comprising N-glycosylation consensus site by 
overlapping oligonucleotides in combination with polymerase chain reaction (PCR) 

so a) Methodology 

[0026] All DNA manipulation experiments were made using standard molecular biology methods as described in 
Maniatis, T., et al., "Molecular Cloning, A Laboratory Manual", Cold Spring Harbor Laboratory, U.S.A. (1982). 
[0027] All chemicals, enzymes and plasmids, unless otherwise stated, are available from various commercial sourc- 
55 es. Buffers and reaction conditions for restrictive endonuclease digestions were used as recommended by manufac- 
turer unless otherwise indicated. 

[0028] Chemical synthesis of oligonucleotides was performed, upon order, by "ISOGEN Bioscience", Amsterdam, 
The Netherlands. DNA samples were supplied in a lyophilized form as sodium salts with free hydroxyl groups on 5' 
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and 3' ends. Such DNA was dissolved in sterile MQ H 2 0. The concentration of single-chain DNA oligonucleotides was 
determined by measuring absorbance at A 260 . Samples of dissolved DNA-oligonucleotides were stored at -20°C. 
[0029] The polymerase chain reaction, further indicated as PCR, was performed by Perkin Elmer Cetus (761 Main 
Ave., Norwalk CT 06859) apparatus, and GeneAmp™ DNA Amplification Reagent Kit with AmpliTaq™ Recombinant 

5 Taq DNA Polymerase. Concentrated solutions and reaction mixtures were prepared according to "Perkin Elmer Cetus 
Protocol for DNA Amplification" as recommended by the manufacturer, with the exception that in the reaction mixture 
there was no template DNA but only only oligonucleotides in a concentration of 100 pmol per reaction (100 ul). The 
temperature cycles profile for each PCR was: melting temperature 94°C/1 min, annealing temperature 40°C/1 min (for 
fragments I and IV) or 50°C/1 min (for fragments II, III and V), polymerization temperature 72°C/1 min. There were 25 

10 cycles. After the last cycle, polymerization time was extended for 7 minutes to complete the polymerization of all chains. 
Thereafter the sample was cooled to 4°C. PCR products were isolated from the reaction mixture by two steps of 
chloroform extraction, DNA was than precipitated with isopropanol, dried in vacuum and dissolved in 50 uJ of TE buffer. 
[0030] Bacterium E. coli K12, JM109 (Yanisch-Perron et al., Gene 33 (1985) 103-119) was used in experiments of 
transformation and amplification of plasmids. Transformation and electroporation of E. coli were made according to 

15 previously described methods (Hanahan, D.J. Mol. Biol. 166, (1983) 557-580; Biorad Genepulser). 

Oligonucleotide Designing 

[0031] The synthetic human proinsulin gene was derived from the published sequence of the human insulin gene 

20 (Bell et al_, Nature 284 (1 980) 26-32). It should be pointed out that when synthesizing the gene, certain modifications 
can be made: it is possible to use various, in most cases preferred codons, to introduce additional functional DNA 
sequences and restriction sites in order to enable a ligation to expression vectors, to change one or more codons of 
a coding region in order to produce a structurally modified polypeptide, yet with substantially the same activity or utility. 
[0032] In the present invention, synthetic human proinsulin gene for expression in fungi was modified. The most 

25 frequently used codons of glucoamylase G1 gene of filamentous fungus A. niger were used, which is expressed in 
high quantities (Boel et al., EMBO J., 3, (1984) 1097-1102; Nunberg et al., Mol. Cell. Biol., 4 (1984) 2306-2315), and 
they were in most cases the same as human codons except in some regions where by choosing a preferred codon 
the oligonucleotide would become very rich in CG nucleotides, which could cause problems in sequencing and possibly 
even in PCR. Therefore some, mostly Gly codons were changed from GGC to GGT, which is the second preferred Gly 

30 codon in glucoamylase gene of filamentous fungus A niger (Fig. 1). 

[0033] To the 5* end of synthetic proinsulin gene there were introduced additional DNA sequences encoding either 
18 amino acids of glucoamylase gene (glaA) signal peptide or 6 amino acids of gene glaA propeptide containing Lys- 
Arg proteolytic cleavage site in order to make possible the construction of two types of expression cassettes. In the 
first case (I) the synthetic human proinsulin gene was regulated by glucoamylase promoter (Pg^) and signal sequence 

35 ( ss giaA) °f glucoamylase gene, whereas in the second case (II) proinsulin gene was fused with entire gene glaA of 
filamentous fungus A. niger via spacer peptide containing KEX-2-Iike protein processing signal under the control of 
PgiaA ( F '9- 2 and Fig. 12). 

[0034] Prepared was also a mutant of the synthetic human proinsulin gene carrying N-glycosylation consensus site 
for studying the influence upon the expression and secretion levels. Since C-peptide is not essential for insulin function 
40 because it is cleaved off in the secretion process, a mutation was introduced in the C-peptide coding region of the 
human proinsulin gene (BCA). A mutant (BPgA) carrying N-glycosylation consensus site AsnXSer was prepared by 
changing the codon GCC for Ala-C20 amino acid to codon AAC for Asn amino acid (Fig. 2). 

[0035] For the synthesis of the human proinsulin gene a new approach was introduced, wherein overlapping, com- 
plementary oligonucleotides in combination with PCR were used. The principle of the method was to use, during the 

45 PCR, two middle, overlapping, complementary oligonucleotides simultaneously as a template and primers, thus ena- 
bling TaqDNA polymerase to "enlarge" them, i.e. to elongate them during the first cycle. In the second cycle such 
"enlarged" oligonucleotides overlap with neighbouring 3' or 5' oligonucleotide, thus enabling TagDNA polymerase to 
"additionaly enlarge" already "enlarged" middle oligonucleotides.. So TaqDNA polymerase actually fills the gaps be- 
tween chemically synthesized, overlapping oligonucleotides and creates a DNA double helix. The synthesis of human 

50 proinsulin gene and its mutant comprising N-glycosylation consensus site was perfomed in fragments which were 
synthesized separately by PCR; The cleavage of genes in fragments was made possible by the presence of two cleav- 
age sites for restriction enzym Pstl in the C-peptide coding region (Fig. 3). Fourteen oligonucleotides were designed: 
ten oligonucleotides (B3, B4, B5, B6, C1, C2, C3, A1, A2 and A3) were deduced from the human proinsulin DNA 
encoding sequence with preferred codons for expression in filamentous fungus A. niger, two oligonucleotides (B1 and 

55 B2) were designed for the introduction of glaA gene signal sequence and additional restriction sites for making possible 
ligation into type I expression vector, one oligonucleotide (87) was designed for the introduction of glaA (G2) gene 
sequence spacer region and additional restriction sites for making possible ligation with type II expression vector, and 
one oligonucleotide (C4) was designed for the introduction of mutation carrying N-glycosylation consensus site (Fig. 4). 
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b) PCR synthesis of fragments I - V 
Fragments I and IV 

5 [0036] Fragments I and V were synthesized "step by step". In the first step of PCR, a reaction was performed with 
oligonucleotides B3 and B4, resulting in the product B3/B4. In the following "step", 1 uJ of the product B3/B4 was mixed 
in PCR reaction mixture with: a) B2 and B5 oligonucleotides for fragment I, and b) B7 and B5 oligonucleotides for 
fragment IV. Products B2/B3/B4/B5 and B7/B3/B4/B5 were obtained. In the third "step", 1 jal of the product B2/B3/B4/B5 
was used for PCR with oligonucleotides B1 and B6 for the fragment I, and 1 ul of the product B7/B3/B4/B5 was mixed 

10 in PCR reaction mixture with oligonucleotides B7 and B6 for the fragment IV. PCR products B1/B2/B3/B4/B5/B6 and 
B7/B3/B4/B5/B6 were obtained. Their size corresponded to the expected one, and the products were digested with 
appropriate restriction enzymes resulting in fragment I and fragment IV (Figs. 5 and 6). 

Fragments II, III and V 

75 

[0037] Fragments II, III and V were synthesized in one "step". PCR was performed with oligonucleotides: a) C1 , C2, 
C3; b) A1, A2, A3; and c) C2, C4, C5. PCR products C1/C2/C3, A1/A2/A3 and C1/C4/C3 were of expected size and 
they were digested with appropriate restriction enzymes resulting in fragment II, fragment III and fragment V (Fig. 7). 

20 Cloning of the PCR-synthesized fragments into pUC 19 vector and DNA sequence analysis 

[0038] The plasmid pUC19 was digested with: a) Pstl, b) Pstl/EcoRI, and c) Pstl/Sall restriction enzymes and ligated 
with appropriate fragments l-V, resulting in plasmids pPZG301, pPZG302, pP2G303, pPZG304 and pPZG305 (Fig. 
8). With these plasmides the bacterium E. coli JM109 was transformed. From the obtained transformants pDNA was 
25 isolated by "mini preparation" and analyzed with restriction enzymes. E. coli transformants containing correct plasmids 
were grown, their pDNA was isolated by "QIAGEN typ 20" column and sequenced by "SEQUENASE® Version 2.0 kit". 
Vectors carrying a correctly synthesized and inserted sequence were used for further constructions. 

PPZG311 and nPZG312 

30 

[0039] Vectors carrying the complete human proinsulin gene and mutant gene with N-glycosylation consensus site 
with the glaA gene signal sequence were constructed as follows: pPZG311 was constructed by a ligation of 173 bp 
• EcoRI-Pstl fragment isolated from pPZG301, 75 bp Pstl-Pstl fragment isolated from pPZG302, and 74 bp Pstl-Sall 
fragment isolated from pPZG303 into EcoRI/Sall digested pUC19 vector. pPZG312 was constructed by a ligation of 
35 173 bp EcoRI-Pstl fragment isolated from pPZG301 , 75 bp Pst-Pstl fragment isolated from pPZG305, and 74 bp Ps- 
tl-Sall fragment isolated from pPZG303 into EcoRI/Sall digested pUC19 vector (Fig. 9). 

Example 2 

40 Construction of filamentous fungal expression vector 
I. Expression cassette type I 
pAN52-7NotBCA and pAN52-7NotBPgA 

45 

[0040] pAn52-7Not£7/dA (Verdoes et al., Gene (1994) in press) was digested partially with Ncol and with Sail. 7.55 
kb Sall-Ncol fragment carrying glaA promoter and frpC gene termination region was isolated and ligated with 317 bp 
Nco-Sall fragment of plasmid pPZG311 , resulting in plasmid pAN52-7NotBCAand 317 bp Ncol-Sall fragment of plasmid 
pPZG312, resulting in plasmid pAN52-7NotBPgA (Fig. 10). 

50 

pAN52-7BCAamdS and pAN52-7BPgAamdS 

[0041] 5.0 kb Notl fragment carrying acetamydase (amdS) gene from A. nidulans was ligated into the Not! site of 
plasmids pAN52-7NotBCA and pAN52-7NotBPgA, resulting in plasmids pAN52-7BCAamdS and pAN52-7BPgAamdS 
55 (Fig. 10). 
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II. Expression cassette type II 
pAN56-7BCAamdS and pAN56-7BPqAamdS 

5 [0042] Vectors for fused glucoamylase-proinsulin protein and glucoamylase-proirisulin mutant with N-glycosylation 
consensus site were constructed as follows: plasmid pPZG304 was digested with EcoRV and AlwNI. Fragment 
EcoRV-AlwNI of 25 bp carrying sequence for 6 amino acids spacer was ligated with 3.07 kb Sall-EcoRV fragment from 
pAN56-7 carrying glaA{G2) gene, resulting in fragment Sall-AlwNI. Vector pAN56-7 was obtained from vector pAN56-4 
(Broekhuijsen et al., J. Biotech., 31 (1993) 135-145), in which the gpdA promoter was replaced with 4.2 kb Notl-Ncol 

10 fragment from pAN52-7 carrying glaA promoter, and in which interleukin gene was removed, leaving EcoRV site up- 
stream from GlaG2. Fragment AlwNI-Sall was then ligated with 10.6 kb Hindlll-Sall fragment from pAB1-6S (Den 
Herder et al., MoL Gen. Genet. 233 (1992) 404-410) which has glaA promoter and amdS gene and with a) 1.06 kb 
AlwNI-Hindlll fragment from pAN52-7BCA, resulting in plasmid pAN56-7BCAamdS, and b) 1 .06 kb AlwNI-Hindlll frag- 
ment from pAN52-7BPgA, resulting in vector pAN56-7BPgAamdS (Fig. 1 1 ). In the spacer there is a dibasic processing 

15 site Lys-Arg. The sequence around the fusion site of glucoamylase gene with human proinsulin gene (BCA) and proin- 
sulin mutant (BPgA) by means of spacer region is shown in Fig. 12. 

Example 3 

20 Transformation of filamentous fungus Aspergillus niger 

[0043] The transformation of filamentous fungus Aspergillus niger was carried out according to already described 
methods (Goosen et al., Curr. Genet. U (1 987) 499-503; Kelly and Hynes, Embo J. 4 (1 985) 475-479; Van Hartingsveldt 
et al.. Mol. Gen. Genet. 206 (1987) 71-75). 
25 [0044] Filamentous fungus A. niger N402 (Bos, C.J. et al., Curr. Genet 14 (1988) 437-443) was transformed with 
expression vectors pAN52-7BCAamdS, pAN52-7BPgAamdS, pAN56-7BCAamdS and pAN56-7BPgAamdS. Colonies 
were selected on culture mediums with acetamide or acrylamide as the sole carbon and nitrogen source. Starting from 
single spores, a further analysis of transformants was performed. 

30 Example 4 

Secretion of human insulin into the culture medium 

1 . Culturing of A. niger transformants carrying human proinsulin gene and mutant gene with N-glycosylation consensus 
35 site " * * 

[0045] About 1 0 8 spores of selected transformants were inoculated into 500 ml Erlenmeyer flasks containing 1 00 ml 
of liquid culture medium containing: 70 mM NaN0 3 , 7 mM KCI, 11 mM KH 2 P0 4 , 2 mM MgS0 4 , 4% dextrin, trace 
elements (1:1000; 76 mM ZnS0 4 , 178 mM H 3 B0 3 , 25 mM MnCI 2 , 18 mM FeS0 4 , 7.1 mM CoCI 2 , 6.4 mM CuS0 4 , 6.2 
40 mM Na 2 Mo0 4 , 174 mM EDTA) and vitamins (1:1000; 100 mg/L of thiamine, 100 mg/L of riboflavine, 100 mg/L of 
nicotinamide, 50 mg/L of pyridoxin, 10 mg/L of panthotenicacid, 0.2 mg/L of biotin). Transformants were grown at32°C 
for 36 hours. Samples were analysed on insulin presence. 

2. Expression of insulin 

45 

[0046] Expression of immunoreactive human insulin was demonstrated by "Pharmacia INSULIN RIA 100" kit, Kabi 
Pharmacia Diagnostics. Samples of transformants cultured by fermentation were filtrated, filtrates were neutralized, 
diluted if necessary and then submitted to RIA (radioimmunoassay) determination (Livessy et al., Clin. Biochem. 13 
(1980) 151-55). The expression level of immunoreactive human insulin (mU/L) in A. niger transformants is shown in 
so Table 1. 



Table 1 



Expression level of immunoreactive human insulin in various A. niger transformants: 


Transformant 


Plasmid 


Immunoreactive insulin (mU/L) 


A. niger 7650 [NCAIM(P)F 001215] 
A. niger 7651 [NCAIM (P) F 001216] 


PAN52-7BCAamo*S 
PAN52-7BPgAa/77C/S 
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Table 1 (continued) 



Expression level of immunoreactive human insulin in various A. niger transformants: 


Transformant 


Plasmid 


Immunoreactive insulin (mU/L) 


A niger 7649 [NCAIM (P) F 001214] 
A. niger 7638 [NCAIM (P) F 001213] 


PAN56-7BCAamdS 
PAN56-7BPgAamdS 


56 
776 



[0047] Transformants A. niger 7650, A. niger 7651, A. niger 7649 and A. niger 7638 were deposited in "National 
Collection of Agricultural and Industrial Microorganisms, H-1502 Budapest, P.O.B. 53, Hungary, on July 8, 1994 under 
accession numbers NCAIM (P) F 001215, NCAIM (P) F 001216, NCAIM (P) F 001214 and NCAIM (P) F 001213 and 
in accordance with the Budapest Treaty on the International Recognition of the Deposit of Microorganisms for the 
Purposes of Patent Procedure, Statement in the Case of an Original Deposit, pursuant to Rule 6.1. 
[0048] From the Table it is evident that there is a distinct difference in the expression of the transformants containing 
various vectors. By far the best result are shown by a transformant obtained by transformation with a vector having 
human proinsulin gene carrying N-glycosylation consensus site fused with glucoamylase protein. 
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Claims 

1. DNA sequence comprising a sequence encoding insulin precursor of formula B-Pg-A, wherein B and A represent 
B- and A-chains of human insulin, respectively, Pg represents modified C-peptide or any number of amino acids 
comprising at least one N-glycosylation consensus site, and Pg links B- and A-chains by proteolytic processing 

. signals or specific chemical cleavage sites. 

2. DNA sequence comprising a sequence encoding insulin precursor of formula B(29)-Pg-A, wherein B(29) repre- 
sents amino acids 1 -29 of B-chain of human insulin, A represents A-chain of human insulin, Pg represents modified 
C-peptide or any number of amino acids comprising at least one N-glycosylation consensus site and Pg links B- 
and A-chains by proteolytic processing signals or specific chemical cleavage sites. 

3. DNA sequence according to claim 1 or 2 with preferred codons for expression in particular hosts. 

4. DNA sequence according to anyone of claims 1 to 3 fused with a protective protein gene by means of a spacer 
region carrying a proteolytic processing signal or specific chemical, cleavage site. 

5. Transformed cells of a suitable host comprising a DNA sequence according to anyone of claims 1 to 4. 

6. Transformed fungal cells comprising a DNA sequence according to anyone of claims 1 to 4. 

7. Transformed Aspergillus sp. fungal cells comprising a DNA sequence according to anyone of claims 1 to 4. 

8. A process for the preparation of insulin by culturing suitable cells, preferably fungal cells, transformed with a vector 
comprising a DNA sequence according to anyone of claims 1 to 4. 



45 



50 
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Patentanspruche 

1. DNA-Sequenz enthaltend eine Insulinvorstufe kodierende Sequenz der Formel B-Pg-A, worin B und A B- bzw. 
A-Ketten des menschlichen Insulins darstellen, Pg ein modifiziertes C-Peptid Oder irgendeine Zahl von Aminosau- 
ren, die wenigstens eine N-Glykosylierungskonsensusstelle umfassen, darstellt und Pg die B- und A-Ketten durch 
proteolytische Prozessierungssignale Oder spezifische chemische Spaltungsstellen verbindet. 

2. DNA-Sequenz enthaltend eine Insulinvorstufe kodierende Sequenz der Formel B(29)-Pg-A, worin B(29) Amino- 
sauren 1-29 der B-Kette des menschlichen Insulins darstellt, A eine A-Kette des menschlichen Insulins darstellt, 
Pg ein modifizertes C-Peptid Oder irgendeine Zahl von Aminosauren, die wenigstens eine N-Glykosylierungskon- 
sensusstelle umfassen, darstellt und Pg die B- und A-Ketten durch proteolytische Prozessierungssignale Oder 
spezifische chemische Spaltungsstellen verbindet. 
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3. DNA-Sequenz gemass Anspruch 1 Oder 2 mit bevorzugten Kodonen fur die Expression in bestimmten Wirten. 

4. DNA-Sequenz gemass irgendeinem der Anspruche 1 bis 3, fusiert mit einem Schutzproteingen mittels einer Ab- 
standshalterregion, die ein proteolytisches Prozessierungssignal Oder eine spezifische chemische Spaltungsstelle 

5 tragt. 

5. Transformierte Zellen eines geeigneten Wirtes, die eine DNA-Sequenz gemass irgendeinem der Anspruche 1 bis 
4 umfassen. 

10 6. Transformierte Piizzellen enthaltend eine DNA-Sequenz gemass irgendeinem der Anspruche 1 bis 4. 

7. Transformierte Aspregillus sp. Piizzellen enthaltend eine DNA-Sequenz gemass irgendeinem der Anspruche 1 bis 
4. 

*5 8. Verfahren zur Herstellung von Insulin mittels Vermehrung von geeigneten Zellen, vorzugsweise Piizzellen, trans- 
formiert mit einem Vektor, der eine DNA-Sequenz gemass irgendeinem der Anspruche 1 bis 4 umfasst. 



Revendications 

20 

1. Sequence d'ADN comprenant une sequence codant pour un precurseur d'insuline de formule B-Pg-A, dans la- 
quelle B et A represented respectivement des chaines B et A d'insuline humaine, Pg represente un peptide C 
rriodifie ou un nombre quelconque d'acides amines comprenant au moins un site consensus de N-glycosylation, 
et Pg lie les chaines B et A par des signaux de traitement proteolytique ou des sites de clivage chimique specifiques. 

25 

2. Sequence d'ADN comprenant une sequence codant pour un precurseur d'insuline de formule B(29)-Pg-A, dans 
laquelle B(29) represente les acides amines 1-29 de ia chame B d'insuline humaine, A represente la chame A 
d'insuline humaine, Pg represente un peptide C modifie ou un nombre quelconque d'acides amines comprenant 
au moins un site consensus de N-glycosylation et Pg lie les chaines B et A par des signaux de traitement proteo- 

30 lytique ou des sites de clivage chimique specifiques. 

3. Sequence d'ADN suivant rune ou I'autre des revendications 1 et 2, avec des codons avantageux pour I'expression 
dans des notes particuliers. 

35 4. Sequence d'ADN suivant I'une quelconque des revendications 1 a 3, fusionnee a un gene de proteine de protection 
au moyen d'une region d'espacement comportant un signal de traitement proteolytique ou site de clivage chimique 
specifique. 

5. Cellules transformees d'un note approprie comprenant une sequence d'ADN suivant I'une quelconque des reven- 
w dications 1 a 4. 

6. Cellules fongiques transformees comprenant une sequence d'ADN suivant I'une quelconque des revendications 
1 a 4. 

45 7. Cellules fongiques d'Aspergillus sp. transformees comprenant une sequence d'ADN suivant I'une quelconque des 
revendications 1 a 4. 

8. Procede de preparation d'insuline par la culture de cellules appropriees, avantageusement des cellules fongiques 
transformees avec un vecteur comprenant une sequence d'ADN suivant I'une quelconque des revendications 1 a 4. 

50 



55 
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B-chain 



Phe val Asn Gin His leu Cys Gry 
TTT GTG AAC CAA CAC CTG TGC GGC 

X x 
TTC GTG AAC CAG CAC CTG TGC GGC 



Ssr HI* Leu Val Glu Ala Leu Tyr 

TCA CAC CTG GTG GAA GCT CTC TAC 

X X 
TCC CAC CTG GTG GAG GCT CTC TAC 



Leu Val Cys Gly Glu Arg Gly Phe 
CTA GTG TGC GGG GAA CGA GGC TTC 

X X X X X 

CTG GTG TGC GOT GAG CGC GGT TTC 



Phe Tyr Thr Pro Lys Thr Arg Arg 
TTC TAC ACA CCC AAG ACC CGC CGG 
X XXX 
TTC TAC ACC CCC AAG ACT CGT CGC 



C-chain 

Gtu Ala Gtu Asp Leu Gin Val Gly Gtn Val Gtu Leu Gly Gly Gly Pro 

GAG GCA GAG GAC CTG CAG GTG GGG CAG GTG GAG CTG GGC GGG GGC CCT 

x x X X X X 

GAG GCT GAG GAC CTG CAG GTG GGC CAG GTG GAG CTG GGT GGT GGT CCC 



(Asn) 

Gly Ala Gly Ser Leu Gin Pro Leu 
GGT GCA GGC AGC CTG CAG CCC TTG 

X XX 
GGT GCC GGC AGC CTC CAG CCC CTG 
(AAC) 



Ala Leu Glu Gly Ser Leu Gtn Lys 
GCC CTG GAG GGG TCC CTG CAG AAG 
X 

GCC CTG GAG GGC TCC CTG CAG AAG 



A-chain 

Arg Gly lie Val Glu Gin Cya Cys Thr Ser lie Cys Ser Leu Tyr Gin 
CGT GGC ATT GTG GAA CAA TGC TGT ACC AGC ATC TGC TCC CTC TAC CAG 

X X XX X XX 

CGC GGC ATC GTG GAG CAG TGC TGC ACC TCC ATC TGC TCC CTC TAC CAG 



Leu Glu Asn Tyr Cys Asn (End] 
CTG GAG AAC TAC TGC AAC TAG 

CTG GAG AAC TAC TGC AAC TAG 

Upper nucleotide sequence shows natural human Insulin coding sequence. 
Lower nucleotide sequence shows synthetic, codon- optimised human Insulin region. 
Bold characters represent A- and B-chalns of human Insufln. ^ _ ^ 
X shows changes In codons resulting of the use of Aspergillus preferred codons. 
( ) shows Introduced mutation carrying N-glycosyiatlon consensus site AsnXSer. 
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Two types of expression cassettes for human proinsulin: 

i) 

I PgteA [I asfllaA |f" h-proinsuBn" 



I I «t»pC | 




''valCysThrGlyl-fiuAlajPhaValAsnQlnHIsLeu 1 

SSgtaA QTC TQC ACA OOQ TTT3 OCA TTC OTO AAC CAQ CAC CTQ h-pfOffWUlfn 



KEX-2 




ProThrAlaAsnVaJDe Ser Lys Arg Phd Val Asn 

gtaA CCC ACOOCQ AATOTO ATATCC AAQCQCTTCOT3AAC N*OlnSuBn 



Schematic representation of human proinsulin gene and its mutant carrying 
N-glycosylation consensus site 



1) BCA 

B-chain 



C-pepWe 



iiiiiiiiiii 



2) BPgA 

B-chaln 



C-peptfde 



"I T 
I 

Ab-»Asn 



A-chain 



IIWIIIIIIIIIIIII! 
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Strategy for pligomideotide designing 

| C | A 

fVNOHlXOaHlVEALWfgCT 

1 



18 amino adds of 
ghA gene signal 
sequence 



81 ^ Fragment I 

* B4 



* BS 



CI 



Fragment II 



Ai Fragment m 
A2 

6 amino acids of % < ^3 

"spacer propeptide M 




Fragment IV 



amino acid C-2Q. Ate-»At n 
CI ^ 
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Fig, d. OhgoBi: 43-mar ^ 

S'-CCGGAMTCC ATGGCGTTCC GATCTCTACT CGCCCTQAGC GGC-3* 

6coR|""ncoT~" 

OU90B2; 4i-fner 

S'-CCTGAQCGGC CTCGTCT6CA CAGGGTTGGC ATTCGTGAAC Of 
OHgoB3: 33-mer 

S'-TTCGTGAACC J 65CAfiCI5Tt3 CGGCTCCCAC CTGOTGOA-T 
AhwNI 

CMigoB*". 36-mer 

S'-GCTCACCGCA CACGAGGTAG AGAGCCTCCA CCAOQT-3T 
OIIgoBS: 3frmar 

5"-AGTCTTGGGG GTGTAGAAGA AACCGCGCTC ACCQCAr? 
Ol]goB6c 40-mar 

y-GGAATTCTGC AGGTCCTCAG GCTCGCGACG AGTCTTGGGG-3* 

5 , -a»GAAJTOG ATATCCAAGC GCTTCGTGAA CW' 

EccRI EcoRV 
OUgoCI: 41-mar 

5--GGAA7TCTGC AGGTGGGCCA GGTGGAGCTG GGTGGTGGTC Or 
ScoRI Pstl " 

OUgoC2: 33-mer 

5--GGGGCTGGAG GCTGCCgSfA CCGGGACCAC CAC-7 
Nasi 

OBgoC3: 39-mer 

S'-GgflAjTQTGC AGGGAGCCCT CCAGGGCCAG GGGCTQGAG-3* 
EcoRJ PsU 

Asn 

OligoC4; 33-mer IS 

5-GGGGCTGGAG GCTGCCGTTA CCGGGACCAC CAC-3* 

OfigoAl: 41-mer 

S'-GSAAJTCTGC AGAAGCGCGG CATCGTGGAG CAGTGCTGCA 
EcoflJ Art"" 

OfifloA2: 40-msr 

S'-TTCTCCAGCT GGTAGAGGGA GCAGATGGAG GTGCAGCAGW 



OHgoAS: 42-mer Stop 

S'^GVJI^GATTCGTCGACiLIftGrTGCAGT AGTTCTCCAG CT*T 
EcoRI BamHI 

• • - show cleaving sites ol restriction endonudeases 
— nwk beginning (Met) and end (Stop) of transition 

» mate a mutation of amino acid Ala — + Asn. wnlch caused the removaJ of Nad 
6fta in C4 oligonucleotide 
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Fig- 6. 

PCR synthesis of fragment IV 



+" Taq DNA polymerase 84 

aw 



following PCR 

97 „ 8W 



earn 

+ Taq DNA polymerase 

4 

B7/B3/B4BS * 



4- 

following PCR 



» ► . 87/83/8483 * 



87/8^3405 

+ Taq DNA polymerase 



B7/B3/B4teS/S6 » 



+ ECORI 
+ P«tt 



Fragment IV, 131 bp 
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Fit. 8. 




17 



EP 0 704 527 B1 




18 



EP 0 704 527 B1 





19 



EP 0 704 527 B1 




20 



EP 0 704 527 B1 




21 



